Strategy Complexity of Concurrent Stochastic Games with Safety and Reachability Objectives

نویسندگان

  • Krishnendu Chatterjee
  • Kristoffer Arnsfelt Hansen
  • Rasmus Ibsen-Jensen
چکیده

We consider finite-state concurrent stochastic games, played by k ≥ 2 players for an infinite number of rounds, where in every round, each player simultaneously and independently of the other players chooses an action, whereafter the successor state is determined by a probability distribution given by the current state and the chosen actions. We consider reachability objectives that given a target set of states require that some state in the target set is visited, and the dual safety objectives that given a target set require that only states in the target set are visited. We are interested in the complexity of stationary strategies measured by their patience, which is defined as the inverse of the smallest non-zero probability employed. Our main results are as follows: We show that in two-player zero-sum concurrent stochastic games (with reachability objective for one player and the complementary safety objective for the other player): (i) the optimal bound on the patience of optimal and ǫ-optimal strategies, for both players is doubly exponential; and (ii) even in games with a single non-absorbing state exponential (in the number of actions) patience is necessary. In general we study the class of non-zero-sum games admitting ε-Nash equilibria. We show that if there is at least one player with reachability objective, then doubly-exponential patience is needed in general for ε-Nash equilibrium strategies, whereas in contrast if all players have safety objectives, then the optimal bound on patience for ε-Nash equilibrium strategies is only exponential.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Strategy Improvement for Concurrent Safety Games

We consider concurrent games played on graphs. At every round of the game, each player simultaneously and independently selects a move; the moves jointly determine the transition to a successor state. Two basic objectives are the safety objective: “stay forever in a set F of states”, and its dual, the reachability objective, “reach a set R of states”. We present in this paper a strategy improve...

متن کامل

Strategy Improvement for Concurrent Reachability and Safety Games

We consider concurrent games played on graphs. At every round of a game, each player simultaneously and independently selects a move; the moves jointly determine the transition to a successor state. Two basic objectives are the safety objective to stay forever in a given set of states, and its dual, the reachability objective to reach a given set of states. First, we present a simple proof of t...

متن کامل

Stochastic Equilibria under Imprecise Deviations in Terminal-Reward Concurrent Games

We study the existence of mixed-strategy equilibria in concurrent games played on graphs. While existence is guaranteed with safety objectives for each player, Nash equilibria need not exist when players are given arbitrary terminal-reward objectives, and their existence is undecidable with qualitative reachability objectives (and only three players). However, these results rely on the fact tha...

متن کامل

Termination criteria for solving concurrent safety and reachability games

We consider concurrent games played on graphs. At every round of a game, each player simultaneously and independently selects a move; the moves jointly determine the transition to a successor state. Two basic objectives are the safety objective to stay forever in a given set of states, and its dual, the reachability objective to reach a given set of states. We present in this paper a strategy i...

متن کامل

Semi-algebraic Tools in Stochastic Games

In this thesis we consider two-person zero-sum stochastic games with a special focus on how tools from the mathematical field of semi-algebraic geometry have been applied to these games. In the first of two parts of the thesis we introduce stochastic games and prove a complexity result about computing the value of a type of stochastic games called concurrent reachability games. We show that the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1506.02434  شماره 

صفحات  -

تاریخ انتشار 2015